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ABSTRACT 

''The  ratio  estimator  can  be  justified  by  a  linear  superpopulation  model 
without  intercept  and  the  error  variance  proportional  to  the  size  of  the 
covariate.  If  either  of  the  assumptions  is  violated,  then  other  estimators 
may  be  considered.  We  study  the  consistency  of  several  estimators  which  are 
based  on  different  assumptions  about  variance  structure  of  the  error.  Some 
decompositions  of  the  finite  population  are  introduced.  Roughly  speaking,  we 
fit  a  weighted  regression  line  to  the  finite  population  with  the  weight  chosen 
according  to  the  estimator  under  consideration.  We  show  that  any  estimators 
in  that  class,  except  the  ratio  estimator,  are  inconsistent  unless  some 
conditions  on  the  population's  characteristics  are  satisfied.  Based  on  the 
decomposition,  modifications  can  ba  made  to  get  a  consistent  estimator.  For 
the  case  of  p-auxiliary  variables,  we  characterize  the  class  of  consistent 
weighted  least  squares  estimators.  The  result  is  extended  to  the  infinite 
population  problem  using  a  completely  different  approach. 


AMS  (MOS)  Subject  Classification:  62D05 

Key  Words:  Weighted  regression  estimator;  Consistency;  Ratio  estimator; 
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SIGNIFICANCE  AND  EXPLANATION 


In  estimating  the  population  mean  of  a  character  of  interest,  we  often 
make  use  of  an  auxiliary  covariate  which  is  already  available.  Ratio 
estimator  is  one  of  the  commonly  used  estimators  in  survey  sampling.  The 
ratio  estimator  can  be  justified  by  a  linear  superpopulation  mod^l  with 
certain  error  structure  assumptions.  If  some  of  the  assumptions  are  violated 
then  other  estimators  may  be  considered.  We  study  the  consistency  of  several 
estimators  which  are  based  on  different  assumptions  about  variance  structure 
of  the  error.  Some  decompositions  of  the  finite  population  are  introduced. 

We  show  that  the  ratio  estimator  is  the  only  consistent  estimator  in  the 
class.  Based  on  the  decomposition,  modifications  for  other  estimators  can  be 
made  to  be  consistent  estimators.  For  the  case  of  p-auxiliary  variables,  we 
show  how  to  choose  the  right  weight  for  the  weighted  regression  estimators  to 
be  consistent.  The  result  is  extended  to  the  infinite  population  situation. 
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CHARACTERIZATION  OF  THE  CONSISTENT  WEIGHTED  REGRESSION  ESTIMATORS 

Li  h- Yuan  Deng* 

1.  Introduction 

Consider  a  finite  population  consisting  of  N  units  with  values  ( yit  Xj),  where  Xj 
is  positive  and  known  for  1  £  i  £  N.  A  simple  random  sample  of  '  lze  n  is  chosen 
without  replacement  from  the  population.  Denote  the  sample  ?  .  x>pulation  means 
of  y  and  x  by  y,  x  and  Y,  X  respectively.  The  ratio  estimator 


is  the  most  commonly  used  estimator  of  Y.  It  is  the  best  linear  unbiased  predictor  of 
Y  under  the  following  superpopulation  model  (  Brewer,  1963;  Roy  all,  1970) 


yi  =  P*i+  £i 


(1.1) 


a2  w.  if  i=j 
0  if  i  *  j 


EM  (  Ej)  *  0  •  EM  (  Ej  Ej)  ® 
with  Wj  =  Xj,  where  EM  denotes  expectation  with  respect  to  the  superpopulation 

model  (1.1). 

If  the  variance  of  £j  is  not  proportional  to  Xj,  one  may  consider  the  best  linear 
unbiased  estimator  (BLUE)  of  Y 


yw  = 


°  xi  yt 

h  wj 

n  Xj2 
i=l  wi 


X  =  pw  X  , 


(1.2) 


where  pw  is  the  weighted  least  square  estimator  of  P  in  (1.1).  In  particular,  for  Wj  = 
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1,  we  have 

0 

£  *i  yi 

yc  =  -4 —  *  •  (i.3) 

Z*i2 

i=l 

For  w4  =  Xj ,  we  have  yR.  For  Wj  =  Xj2,  we  have 

-  i  a  ^  v 

yh  =  -  Z—  x .  (i.4) 

n  i=l  xi 

Royall(1970)  took  a  different  approach  to  the  estimating  problem  for  Y.  He 
showed  that 

y  =  f  y  +  (1— f)  pw  xr  (15) 

is  the  best  linear  unbiased  predictor  (  in  contrast  to  estimator  )  of  Y  under  superpo¬ 
pulation  model  (1.1),  where  f  =  n/N  is  the  sampling  fraction,  and 

jq  =  (N  X  -  n  xy(N  -  n) . 

It  is  easy  to  see  the  difference  between  the  two  is 

y-  Yw  =  f(  y  -  Pw*)-  (16) 

From  (1.6)  one  can  easily  see  that 

•  * 

y  -  y  w  — >  0  as  n  — >  <»  . 

either  model  (1.1)  is  true  or  f  — >  0  as  n  — » <».  In  practice,  the  sampling  fraction  is 

A  * 

indeed  very  small.  In  either  case,  this  implies  y  and  yw  are  asymptotically 
equivalent.  The  same  remark  can  be  applied  for  yc  and  yh  . 

It  is  well-known  that  yR  is  a  consistent  estimator  of  Y.  We  will  first  study  the 
asymptotic  properties  of  y"c  ,  yj,  in  Sections  1  and  2.  More  general  results  for  yw 


are  obtained  in  Section  3.  In  fact,  we  can  show  that  yR(  w;  =  x;)  is  the  only  con 

*  A  A 

sistent  estimator  of  Y  among  yw  .  We  also  show  that  yc  ,  yh  can  be  consistent  esti¬ 
mators  of  Y,  provided  that  some  conditions  on  the  population  characteristics  are 

satisfied.  Based  on  these  ,  we  can  easily  modify  yc  ,  yh  to  get  consistent  estimators 
of  Y.  A  useful  decomposition  of  the  finite  population  is  developed.  In  Section  3,  we 
extend  the  results  in  Section  2  to  the  case  of  p-dimensional  auxiliary  variables.  Lastly, 
similar  properties  can  be  found  for  the  case  of  infinite  population  in  Section  4.  The 
techniques  used  are  quite  different  from  those  in  the  previous  sections. 

The  asymptotic  behavior  of  an  estimator  will  be  studied  under  the  traditional  ran¬ 
domization  distribution  generated  by  repeated  sampling  form  a  fixed  population 
according  to  simple  random  sampling  (s.r.s).  Following  the  usual  formulation  of 
asymptotic  theory  in  finite  population  (see  for  example,  Scott  and  Wu,  1981;  Isaki  and 
Fuller,  1982  ),  we  embed  our  finite  population  in  a  sequence  of  populations  {  Uv  } 
indexed  by  v  with  population  size  Nv  and  a  a  simple  random  sample  of  size  nv, 
where  0  <  Ni  <  N2  <  ....  and  UjC  U2CU3  •  •  •  .  Let  a  sequence  of  samples  {  sv  } 
of  size  {  nv}  be  created  from  the  sequence  of  populations  {  Uv}  by  a  sequence  of 
designs,  where  nj  <  n2  <  ....  and  nv  <  Nv  for  all  v.  Note  that  while  the  sequence  of 
populations  is  nested,  the  sequence  of  samples  is  not. 

Let  the  population  Uv  consist  of  Nv  units  with  values  {  (  y4,  Xj)  ,  i=l,...,  Nv  }. 
The  population  mean  Xv  of  the  auxiliary  variates  is  assumed  known.  Yv  denotes  the 
population  mean  of  the  character  under  study.  Let  xv,  yv  denote  the  sample  means  of 
x  and  y  based  on  a  s.r.s.  without  replacement  (  srswo  )  of  size  nv  drawn  from  popu- 


lation  Uv. 


For  a  general  sampling  scheme,  Isaki  and  Fuller(1983)  gave  a  sufficient  condi¬ 
tions  for  the  Horvitz-Thompson  estimator  to  be  consistent.  For  the  simple  random 
sampling,  Lemma  1.1  below  is  a  corollary  of  Lemma  1  of  Isaki  and  Fuller(1983). 


Lemma  1.1.  Let  {  zj  be  a  fixed  sequence,  and  satisfies  the  condition 


4r  £(  Zi  -  Zy)2  <  M  <  oo  for  all  V 
Nv  i=i 


Then 


—  „ ,  _ln*  _  i  N, 

Zy  -  Zy  =  Op(  nv-°-5) ,  where  Zy  =  —  £  *i  and  Zy  =  -rr-  Z  •  (1.6) 


N 

i=l 


Throughout  this  paper,  we  will  assume  that  for  any  "reasonable"  function  g  , 
Zj  =  g(  xj,  yj.  wt)  will  satisfy  condition  (1.5). 


2.  Asymptotic  Behavior  of  yc  ,  yh  and  yw 


2.1.  Asymptotic  Behavior  of  yc 

We  can  fit  the  regression  line  to  the  finite  population  {  (  y4,  Xj),i=l,2,..,  Nv  } 


yi  —  ®c,v  +  Pc,v  *i  +  dj  , 


where 


Pc.v  = 


£<*i-  Xv)(  y,  -  Yv) 

t=l _ 

N'  -  5 

£(*i-  Xy)2 

i=l 


-  .  J*-  -I  .  *  -  'r  .•**•«.  «  \  *0  v  *  V  «  *  » *  »  "  •».  *  ..  *  -  *«  J*  .*%.'*  J  ••  „  * 

-  *•  -s  P  **  •  •  -  V-  V  /  V  V  V  V  V*  V  .V  V  ^  V  V  >  V''*  w  *  *■  v  •  ‘  ' 

.'4,  V>>wV.V..vPW,,*V' V-V\ V-W 


®C,V  =  Yy  -  PC(y  Xy  , 
di  ~  yi  "  ®c,v  ’  Pc,v  xi  • 

It  is  straightforward  to  see  {  d*  }  satisfy  the  following 

Ldi-O.XdiXi-O. 
fc=l  i=l 

We  can  use  the  decomposition  in  (2.1)  to  characterize  the  asymptotic  behavior  of 

yc,v 

Theorem  2.1. 

_  S2 

(a)  Yc.V  '  Yy  =  tXcyf  -  _  ^  ]  +  Op(  Hy-0-5)  . 

Xy 

(b)  yc  y  -  Yy  0,  if  ^lim  0^  v  =  0  , 
where 


nr 

X  xi  Yi 

t=l 

SXi2 

i=l 


Xy  . 


s,2,  -  X<  Xi  -  xv)2  .  xv<2>  = 


v  *  j=l 

Proof.  Using  (2.1)  and  (2.6),  we  have 


1  , 
TT£x‘ 

1Nv  j=i 


1 


yc.v  =  ®c,v  _ (2\  Xy  Pc,v  Xy  +  m  Xv  ,  where  xv^  ^ 


Xy  _ 


(2)  -*v  ■  rc.v-y-  (2) 

^■y  Ay 


1 


This  together  with  Yv  =  ac  v  +  Pcv  Xv  ,  implies 


xv  _ 


1  - 

-  Z  xi  di 


yC,V  ”  Yy  =  <XC)y[  _  ^  Xy  -  1]  + 


‘v  i=l 


(2) 


Xy 


From  (2.7),  — —  £  xt  d|  =  Op(  ny-0-5)  .  and  *  Xv  =  — 


-0.5 


(23) 


-  (2> 


(2) 


+  Op(  nv 


£  1  .□ 


have  Part(a).  Part(b)  follows  from  Part(a)  and 


From  Part(a)  of  Theorem  1,  the  leading  term  of  yc  v  -  Yv  depends  only  on  the 

sign  of  a,.  v,  the  intercept  of  the  regression  line.  If  ac  v  >  0  (  <  0  ),  then  yc  v  has  a 

positive  (  negative  )  bias  of  constant  order.  If  oc^y  — >  0  as  v  — >  0,  then  yc  v  is  a 
consistent  estimator  of  Yv.  In  practice,  ac  v  is  of  course  unknown.  However,  we  can 

estimate  ac  v  consistently  by  using  the  sample  analog  ac  v  of  occ  v  in  (2.1).  It  is  easy 

to  modify  yc  v  to  get  a  consistent  estimator  of  Yv.  For  example, 

S2  .  s2  _ 

yC  v  +  ac  v-_ and  yc  v  +  ac  v  are  consistent  estimators  of  Yv,  v.  here 

Xy  Xy 

sx,v>  Xy(2)  are  the  sample  analogs  of  S*  v,  Xv*2)  . 

2.2.  Asymptotic  Behavior  of  y^v 

We  need  a  different  decomposition  of  the  population  for  studying  the  asymptotic 
behavior  of 

i  y;  _ 

yh,v  =  -jr  (I -r)  xv  •  (2.8) 

nv  i=l  xi 

Fit  a  weighted  regression  line  with  weight  proportional  to  xf1  to  the  finite  population 
Uy 


where 


Yi  —  ®h,v  +  Ph,v  xi  +  *^i  » 


(2.9) 


°i  =  Yi-  «h,v  -  Ph,v  xi  • 

Note  that  {  dj  }  satisfy  the  following 

N,  N,  i 

Zdi  =  0»E-7  =  o. 

t=l  i=l 

Using  the  above  decomposition,  we  can  find  the  leading  term  of  yh  v  -  Y 
Theorem  2.2. 


(a)  yh.v  -  Yv  =  ah  v[  Xv  xfl)  -  1]  +  Op(  nv^)  . 


-0-5\ 


(b)If  lim  Xv(  <  oo  ,  then  yh  v  -  Yv  -P— »  0,  if  ah  v  — >  0  as  v  — 


V - *o© 


where  Xv(  ^ 


ny 


1 


N  ^  X* 

i=l 


Proof.  From  (2.8)  and  (2.9),  we  have 


k.v  -  <v-~-  £-r  x- +  x» +  <-r  £-r>  x« 

nv  xj  nv  i=1 


This  together  with  Yv  =  ah  v  +  (3^  Xv  implies 


yh.,  •  Yv  =  oh,v(  xv'-»  X,  -  1)  +  -f-  xv . 

nv  i=l  xi 

Part(a)  follows  from  the  above  expression  and  xv(_1)  =  Xv(_1) 


1  £  di 


1  d; 

-  Y —  =  Od(  nv“°-5) .  Part(b)  follows  immediately  from  Part(a).  □ 


From  the  Jensen’s  inequality,  we  can  see  that  Xv  Xv 


-  1  >  0.  Therefore,  the 


sign  of  the  leading  term  of  yhv  -  Yv  depends  only  on  ah  v  ,  the  intercept  of  the 
weighted  regression  line.  If  ah  v  >0  (  <0  ),  then  yh  v  has  negative  (  positive  )  bias  of 
constant  order.  Theorem  2.2  not  only  gives  a  condition  for  yhv  to  be  consistent  but 
also  indicates  how  to  modify  y^v  to  get  consistent  estimators  of  Yv. 


2 J.  Asymptotic  Behavior  of  ywv 


To  study  the  asymptotic  behavior  of 


n*  xi  yi 

1.  y  wi  - 

yw.v  =  <>  xv , 

X:2 

S-T 

j=l  wl 

we  need  the  following  decomposition  of  the  finite  population 


(2.13) 


yi  ~  ®w,v  "*■  Pw,v  xi  +  tii  , 

where  d;  satisfy  the  following 


(2.14) 


N»  x,  di 

i=l  j=l  wi 


(2.15) 


Note  that  if  the  vector  of  {  wj  and  the  vector  of  {  xj  are  linearly  independent,  then 
awv,  Pw.v  wi^  be  uniquely  determined  under  condition  (2.15).  We  can  obtain 
awv,  Pw,v  by  fitting  the  weighted  least  square  line  as  in  (2.14),  with  the  weight  pro¬ 
portional  to  Wj.  Using  the  above  decomposition  we  can  show  the  following  theorem: 


Theorem  2.3.  If  a^y,  Pw,v  can  be  uniquely  determined  in  (2.14),  then 


.  _  x 2  _ 

(a)  y.,.  -  Yv  =  «»,,[ - - WJ+  0,(nv-°-5). 

N,-'  £  *i2  wf1 

t=l 

x 2  _ 

(b) [ - _1 - Wv]  £  0,  for  all  Ny. 

Ny"1  X  Xi2  Wf 1 

«=1 

_  1  N' 

(c)  If  lim  Wv  <  oo  and  0  <  lim  — -  £  x2  wf1  <  ®o, 

V - »00  V - Ny 

then  yw  v  -  Yv  -p— »  0,  if  c^y  — >  0  as  v  — >  oo  , 

J.  i  N» 

where  Wv  =  — — -  £  w;. 

NV  i=l 


Proof.  From  (2.13)  and  (2.14),  we  have 


«v  1  Z  wi  1  *i 

i=l 


Yw.v  =  C^w.v" 


+  Pw.V  Xy  + 


v1  z  wr1  v 

t=i 


nv  1  Z  wi  1  xi2 

i=l 


From  this  and  Yv  =  cq*  v  Wv  +  Pw  v  Xv  ,  we  have 


nv  1  Z  wi  1  xi 

i=l 


yW,V  “  Yy  —  Ctyyyi" 


Xy  -  Wy]  + 


Ily'1  £  wi  1  xi2 
i=l 


n. 


nv  1  Z  wi  xi 

i=l 


Part(a)  follows  from  the  above  expression  and 


1  2—  -  0,(  nv-°J) 


nv  i=1  w, 


Part(b)  follows  from  the  Cauchy-Schwartz  inequality.  Part(c)  follows  immediately 
from  Part(a).  □ 


From  Theorem  2.3,  we  can  see  that  the  leading  term  of  ywv  -  Yv  depends  only 
on  a,*  ,,,  the  "intercept"  of  the  regression  "line"  in  (2.14).  Moreover,  If  >  0  (  <  0 

)  ,  then  yw  v  has  negative  (  positive  )  bias  of  constant  order.  Theorems  2.1  and  2.2 
are  special  cases  of  Theorem  2.3  with  Wj  =  1  and  Wj  =  x;2.  However,  Theorems  2.1 

*  A 

and  2.2  give  the  explicit  formulae  for  the  leading  term  bias  of  yc  an<*  yij  **»  terms  of 
the  usual  regression  decomposition  of  the  population. 

3.  p-dimensional  Auxiliary  Variables 

In  Section  2,  we  assume  that  only  one  auxiliary  variable  (x)  is  available.  Estima- 

A  A  _ 

tors  like  yw  v  =  |JWV  Xv  are  justified  by  the  superpopulation  model  (1.1).  The 
regression  estimator  is  another  popular  estimator  of  Yv.  It  is  the  best  linear  unbiased 
predictor  under  the  following  superpopulation  model  (Roy all,  1970) 

yi  =  a  +  p  Xj  +  Ei  (3.1) 

o2  w,  if  i=j 

Em  (  ^i)  =  0  ;  E^j  (EjE))=  g  ifi^tj 

with  wj  =  1. 

If  the  constant  variance  assumption  is  in  question,  one  may  like  to  consider  an  estima¬ 
tor  of  Yv  which  is  based  on  the  weighted  least  estimator  of  a, (3  with  some  weight  Wj. 
As  we  have  seen  in  Section  2,  not  all  kinds  of  weight  will  give  us  consistent  estima¬ 
tors.  As  a  matter  of  fact,  in  Section  2  the  only  weight  we  may  choose  such  that  yw  v 
is  consistent  is  Wj  =  Xj.  We  may  ask  the  same  question  for  the  weighted  regression 
estimators.  More  generally,  for  more  than  one  auxiliary  variates,  the  commonly  used 


estimator  is  the  multiple  regression  estimator.  We  may  again  consider  a  weighted  least 
square  estimator  for  p-auxiliary  variables.  We  would  like  to  know  what  kind  of  weight 
will  result  in  a  consistent  estimator.  If  the  weight  does  not  give  consistent  estimator  for 
all  populations,  we  would  like  to  characterize  the  leading  term  of  the  bias  in  terms  of 
some  simple  population  characteristics  so  that  we  may  modify  the  estimator  to  get  a 
consistent  estimator.  We  would  like  to  extend  the  results  in  Section  2  to  the  case  of 
several  auxiliary  variables.  Some  notations  will  be  introduced. 

Consider  a  finite  population  Uv  indexed  by  v  with  population  size  Nv  and  a  sim¬ 
ple  random  sample  (s.r.s.)  with  size  nv.  Let  X1,X2,..,Xp  be  the  p-auxiliary  variates 
which  are  correlated  with  character  Y.  The  purpose  is  again  to  estimate  Yv,  the  popu¬ 
lation  mean  of  the  variable  of  interest  Under  simple  random  sampling,  a  sample 
(  yv  x0  of  size  nv  is  chosen,  where  yK  =  (ySi  ,yi2 ,  ■  •  •  ,yla)' , 

X*  =  (  X  ,  X^ ,  •  •  • ,  X.  J ,  X  «  (xu,x2i,  •  •  •  ,xpi)' . 

Consider  a  general  class  of  weighted  least  square  estimators 


y«,»  »  x,(  X.,'  Wt-1  XJ-1  XJ  Wfc->  =  xv  pw. 


where 


ws,  =  diag(Wi)  ,wh ,  •  •  •  Wj  ) ,  XV  =  -|-XX. 

^  i=l  1 

and  Wj  >  0  is  the  weight  associated  with  unit  i. 

Note  that  ratio  estimator  is  a  special  case  of  (3.2),  with  p=l,  w;  =  Xj,  and  the 
legression  estimator  estimator  is  also  a  special  case  of  (3.2),  with  p=2,  Wj  =  1,  and  the 
first  column  of  X  matrix  is  all  1  ’s  and  the  second  column  is  X/s. 


An  important  question  one  may  ask  is  that  what  kind  of  weight  Wj  we  can 
choose  such  that  ywv  will  be  a  consistent  estimator  for  Yv  ?  If  ywv  is  not  con¬ 
sistent  we  would  like  to  find  the  leading  term  of  yw  v  -  Yv  in  terms  of  some  simple 

population  characteristics.  Hence,  one  can  modify  ywv  to  get  a  consistent  estimator 
of  Yv. 


To  characterize  the  asymptotic  behavior  of  ywv  ,  we  need  the  following  decom¬ 
position  of  the  finite  population  Uv 


where 


and 


yv  =  Xv  pv  +  Zy  Yv  +  dv  (3.3) 

Zy  =  (wj.wj . w  Nt)'  ,  Wv  =  diag(w1,w2,...,w  Ny)'  (3.4) 

yv  =  (yi.y2»-.y  n,)'  » =  (d1,d2,...,d  N/ , 
xv  =  (  X.  x2,..,  x  Nr ,  x  =  (XU,  xa. ..  Xpj)' . 

Note  that  Pv  and  yv  will  be  uniquely  determined,  if  Zy  is  not  in  the  column  space 
of  Xv  ,  the  dimension  of  the  column  space  of  Xv  is  p  ,  and 


Xv  '  Wv  -1  dv  =0 


(3-5) 


and 


N, 


Zy'Wy^dy  =lN/dy  =  £  ^  =  0 

i=l 


(3.6) 


Here  is  our  key  Theorem. 


Theorem  3.1.  Assume  that  (  Xv  '  Wv  Xv  )  is  non-singular,  for  all  v. 

(a)  If  1  n,  e  C°1  (  Wv  -1  Xv  ) ,  then 

yw,v  •  Yv  =  (  yv  ■  Yy)  -  (  xv  -  Xv)  j5W(V  +  Op(  nv  ) 
where  1  Ny  =  (  1,  1,..,  1)'.  yw>v  is  then  obviously  a  consistent  estimator  of  Yv. 

(b)  If  1  N col  (  Wv  -1  Xv  ) ,  then 

y».  -  Y,-[X,(-i-  X,  '  Wv  ->  X,  >  X,'  -  WJr,  +  Op(  n,-^) ,  (3.7) 

’  Ny 

where 

[Xy  (-4r  Xy'Wy"1  Xy)  ^y'-  Wy)  £  0  ,  {OT  M  V  .  (3.8) 

Ny 

y  w.v  -  Yy  — ►  0,  if  yv  — *  0  as  v  — >  «>. 

Proof.  Note  that 

Yw.V  “  *V  Pw,V  ■  (  *V  "  Xy)  Pw.V  • 

The  assumption  of  Part(a)  implies  1  ^  '  =  c'  X,/  W^1  for  some  c  ,  from  which,  we 
have 

xv  Pw,v  =  ~~Z~  c  XK  W,  y^  =  yv  .  (3.10) 

uv 

Note  that 


—  (  XK'  Ww-'  X  J  =  -i-  (  X.  '  W„  -•  X,  )  +  Op(  n,-°  5) 


(3.1)) 


—  V w*r y^=Txv  wv yv  +  op( iiyj) . 


Since  (  Xv  '  Wv  -1  Xv  )  is  assumed  non-singular  for  all  v 


( K  wfc-‘  XJ  f 1  =  [-i-  (  X, '  W,  -1  X, )  f 1  +  Op(  nv-°5) , 


which  implies 


P»,v  -  P..V  +  Op(  n,-0-5)  .  (3.12) 

Part(a)  follows  from  (3.10)  and  (3.12).  To  prove  Part(b),  we  will  use  the  decomposi¬ 
tion  (3.3).  Note  that  1  Nf  4  col(  Wv  _1  Xv  )  implies  Z*  =  Wv  1  4  col(  Xv  ). 

Therefore,  the  coefficients  Pv  ,  yv  in  the  decomposition  (3.3)  will  be  uniquely  deter¬ 
mined.  Using  (3.2),(3.3)  and  W^"1  Z^  =  1  ^  ,  we  have 


yw.v  =  Xv  pv  +  Xv(  XK'  W,  1  Xu)  1  nv  xv  yv 

+  Xv(  Xh'  w^-1  X^)  _1  XK'  w^-1  d^  . 
From  (3.3)  and  (3.6),  it  is  straightforward  to  see 

Yv=  Xvpv  +  Zvyv  +0  =  XVPV  +  W v  yv  . 
Taking  the  difference,  we  get 


Yw.v  -  Yv  =  [  Xv(— X,,'  W^1  xj  xv  -  Wv]  Yv 

nv 


+  xv(—  x^  W^-1  XJ  (—  X^'  W^-1  dj 


(3.13) 


Using  (3.11)  and 


—  Xs;  W^"1  d^  Xv  '  Wv  _1  dv  +  Op(  nv-°-5)  =  Op(  nv-°-5) , 

IN  y 

it  is  easy  to  see  last  term  of  (3.13)  is  Op(  nv"°  5).  (3.7)  follows  from  (3.13)  and  xv 
Xv  =  Optn^-5).  And  (3.8)  holds  because 


■.'*  .  **  .v  w  -  ..  *  -  *  -  * 

*  .  *  .  •'  *  „  «*  .  •  4  '-■*<-  v“  O  'w  v  *. 

*  •  */  *V  *«*  .*  *  •  *,•  *  -v.  ’  -  *  ,  •'  '  ,  , 


V’T^TTITV^V^VMi.  *"J ▼TT 


V  rjx'i  *rv  '■""“ 


wv  -  xv(4-  xv '  wv  -»  xv  )_1  X/ 

Nv 

=  ~-1N,/(Wv  -  XV(XV/WV-1XV)-1XV')1N. 

=  —  [c'  (  I  -  M(M'M)-1M')  c]  *  0  , 

Nv 

where  M  =  Wv  ~m  Xv  ,  c  =  Wv  i/2l  N,  and  I  -  M(M'M)"1M'  is  a  positive  semi- 
definite  matrix.  □ 

Wright(1983)  considered  a  class  of  asymptotically  design-unbiased  (ADU)  predic¬ 
tors  under  a  general  sampling  design,  our  conclusion  in  Part(a)  is  the  same  as  his 
Theorem  1.  However,  Part(b)  of  our  Theorem  3.1  uses  some  simple  population  charac¬ 
teristics  to  characterize  the  leading  term  of  yw  v  -  Yv  when  1  Ni  is  not  in  the  column 

space  of  Wv  -1  Xv  .  From  Part(b),  we  can  then  easily  construct  some  new  consistent 

*  * 

estimators  of  Yv.  For  example,  let  Yv  be  the  sample  analog  of  yv  ,  then  yw  v  - 

_  i  ,  -l  _  _  .  _ 

Xv[( -  Xv  '  Wv  _I  Xy  )  Xv  -  Wv]  Yy  is  a  consistent  estimator  of  Yv. 

nv 

4.  Characterization  of  a  Class  of  Consistent  Estimators  in  Infinite  Population 

To  characterize  a  class  of  consistent  estimators  for  infinite  populations,  the  tech¬ 
niques  used  in  Section  2  for  finite  population  are  not  appropriate.  A  completely 
different  approach  will  be  taken.  The  asymptotic  framework  for  the  infinite  population 
is  the  standard  one.  Therefore,  the  notation  is  simpler. 

Let  (  Yi ,  X  )  be  an  i.i.d  random  vectors  with  distribution  F(Y,  X),  i=l,2,  ..  ,  n, 
~i  ~ 

where  Y;  is  the  variable  of  interest  and  X  is  a  p-dimensional  variables.  The  joint 

~i 

distribution  F  of  Y  and  X  need  not  to  be  known.  The  main  purpose  is  to  use  both  X 
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and  Y  to  estimate  E(Y)  =  jxy,  where  E  denotes  the  expectation. 


Consider  a  class  of  estimators 


|L  =  p  '(A'V-Ia)  "1a,v-1y  =  \i '  p 

1  -X  -*  -X  -W 


where  p  is  any  consistent  estimator  of  E(  X),  i.e. 

~x 


P  =  E(X)  +  op(l) 


P  =  (A'V_1A)  -1A'V_1Y  ,  Y  -  (  Yj,  •  •  •  ,  Yn)'  , 

“W  ~t  ~ S 

AMp  =  (  X  ,  X  ,  •  •  • ,  X  )' ,  X  =  (ZH,  Z*,..,  Zpi)' , 

r  ~ 1  ~2  ~n  ~i 

V  =  diag(  Wj,  w2,..,  Wn  )  ,  Wj  =  g(  X  )  for  some  function  g. 

_i 

Note  that  the  estimator  in  (4.1)  is  appropriate,  if 

Y,  =  X,'  P  +  €,  ,  (4.2) 

where  Cj  are  i.i.d.  with  mean  zero  and  finite  variance.  Since  P  is  the  weighted  least 


estimator  of  p,  it  can  be  shown  that  P  is  a  consistent  estimator  of  P  under  (4.2), 

~w 

for  any  choice  of  weight  Wj.  However,  if  (4.2)  is  not  the  true  model,  say  there 
should  be  a  Z  variable  in  the  model 

Yj  =  X  '  {5+  Z  'y+  Cj,  (4.3) 

11 

then  p  will  not  be  consistent  for  P  under  (4.3  ),  unless  X  and  Z  are  "orthogonal". 


But  py  will  still  be  a  consistent  estimator  for  E  (  Y  )  even  if  (4.2)  is  not  the  true 
model  provided  we  choose  the  "right"  weight  Wj.  It  is  our  purpose  in  this  section  to 


*  ^ %V- V- .* -\V ■■../’ .. a/.  _A.A-\ 


'  v  -  /.  •'"* 
"V  V  ' .•  V  .•  **"*•"1  1 

•  '  's’*/ 


characterize  the  weight  Wj  such  that  Py  will  be  consistent  for  estimating  E  (  Y  ) . 

In  the  context  of  survey  sampling,  the  X  variable  is  called  the  auxiliary  variable. 
Hence,  usually  the  marginal  distribution  of  X  is  known,  or  E  ( X  )  is  available.  In 

that  case,  we  may  choose  p  =  E  (  X  ) ,  and  the  estimator  in  (4.1)  becomes 

“X 

L=  E  (  X  )  '(AV-U)  , 

which  is  similar  to  yw  v  in  Section  3. 

Let  X  =  (Zj,  Zj,..,  Zp )'  where  Z1(  the  first  component  of  X,  has  the  same  distri¬ 
bution  as  Zjj's,  and  7^  has  the  same  distribution  as  Z^'s,  ...  etc.  Let 
W  =  g(  X)  =  g((Zj,  Z2,..,  Zp)')  be  a  real  random  variable.  W/s  are  i.i.d.  with  the  same 

distribution  as  W. 

We  will  impose  some  mild  conditions  on  the  random  variables  Z,  's,  W  and  Y. 

(AO)  |i  =  E  (  X  )  +  op(l)  , 

I  Zj  Zj  I 

(Al)  E(  I  —^-P- 1)  <  00,  for  all  ij=l,2,..,p  , 

I  w  I 

I  Z  Y  I 

(A2)  E(j— j)  <  for  all  j=l,2,..,p  , 

=  E(W-1  X'  X)  is  a  non-singular  matrix. 

U 

Before  stating  our  key  Theorem,  let  us  derive  some  properties  of  |i.y  based  on 


(A3) 


E(l 


Z;  z 


w 
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these  assumptions. 


From  (Al)  and  the  Weak  Law  of  Large  Numbers, 


-  (AW1  A)  =  -  £  Wf1  X  '  X  =  EOV1  X'  X)  +  o.(l) 
n  n  ._i  ~  i  ~  i  -  -  v 


Using  this  and  assumption  (A3),  it  is  easy  to  see 


(-  (AW*A) )  =  [E(W-1  X'  X)]'1  +  0.(1) . 

n  -  _  r 


7-1  V'  YYI-1 


From  (A2)  and  the  Weak  Law  of  Large  Numbers, 


—  AW*Y  =  -  f  Wf1  X  '  Yj  =  E(W -1  X'Y)  +  oB(l) 
n  ~«  n  • ,  -i  -  v 


Combining  (AO),  (4.4)  and  (4.5),  we  have 


py=|i'P  =  [  E  (  X  )  +  0(1)]'  [(-  (AWJA) )  (-  AWJY  )] 


=  E  (  X  )  '[EOV*1  X'  X)]"1  EOV"1  X'Y)  +  op(l) 
Now  we  are  ready  to  state  our  key  result. 


Theorem  4.1.  If  assumptions  (A0)-(A3)  hold,  then 


\Ly  =  V  (AW1  A)  _1A'V-1Y  -P->  E  (  Y  ) 


if  and  only  if 


W  =  £  Cj  Zj  for 
H 


some  Cj  s 


Proof,  (i)  Sufficiency:  From  (4.6) 


\ly  =  E  (  X  )  'EOV'1  X'  Xr'EOV"1  X'Y)  +  oD(l) 


=  E  (  X  )  '[EOV-1  X'  X)-1  C T1]  [C  EOV-1  X'Y)]+  op(l)  ,  (4  7) 

where  C  is  any  non-singular  matrix  with  the  first  row  equal  to  ( Cj,  C2,..,  cp).  Note 


■v.v.'.'.v.v.-.'/.'.-.v  v  v  '.•'•/  ■  •N'  ..-  •/(‘"-.-'•.■■'-/"v'Vy-  .'- 

-v  ■>  ■ .  •  -  -  •  ■ .  •;  •.•.-.-.j  - ,  - ,  .•  /*  ■-  .V 


C  E(W-1X'Y)  = 


wl 


E(  W1  ZjY) 


E(  W"1  ZpY) 


U.  \  l  / 


and 


E(  W1  X'  Xf'cr1  =  [C  E(  W*1  X'  X)]"1  = 


E(  Zx)  E(  Zj) .  E(  Zp) 


-l 


which  will  be  denoted  by  B  From  (4.7)-(4.9),  we  get 


r _ i 


|ly  =  [E(Z|)JE(Z2),..^(Zp)]  B 


-l 


E(Y) 


+  o0(l) 


=  (1,0,0,..,0) 

Therefore,  we  proved 


E(Y) 


+  op(l)  =  E  (  Y  )  +  op(l) 


Hy  -  E  (  Y  )  -U  0 


Note  (4.10)  is  true,  because 


(H.a 


(4.9 


(4.10 


[E(  Z,),E(  Z2),..JE(  Zp)]  =  (1,0,0, ...0)  B  . 

(ii)  Necessity:  Assume  that  as  n  — » <*»  p.y  E  (  Y  )  .  From  (4.6),  we  know  that 


|iy  -p->  E  (  X  )  '(E(  W1  X'  X))_1E(  W^X'Y)  , 


'ret  1X7-1  Y'  YM_iTI7  W-1’ 


Hence 


E  (  Y  )  =  [  E  (  X  )  '(E(  W_1  X'  X))-1]E(  W^X'Y) 


=  (di,d2,-,dp) 


E(  W1  ZiY) 
E(  W"1  Z2Y) 


E(  W1  ZpY) 


=  E(f  W1  dj  ZjY)  , 
j=i 
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where  (  dt,  d2,..,  dp)  =  E  (  X  )  '[E(  W'1  X'  X)]"1.  Therefore 

E([£  W"1  dj  Zj  -  1]  Y)  =  0,  for  any  Y  , 
j=i 

which  implies 

£  W_1  dj  Zj  =  1  with  probability  1  . 

M 

Hence, 

w-tdjZj. 

i=i 

This  completes  the  proof  of  Theorem  4.1.  □ 

As  we  can  see  from  the  proof  of  Theorem  4.1,  the  i.i.d.  assumption  for  {(  X;,  Y; 
)}  is  not  necessary.  All  we  need  is  formula  (4.6)  which  is  implied  by  (4.4)  and  (4.5). 
Note  that  (4.4)  and  (4.5)  hold  for  simple  random  sampling  without  replacement. 
Hence,  Theorem  4.1  is  an  extension  of  Theorem  3.1.  One  possible  application  of 
Theorem  4.1  would  be  in  the  missing  data  problem.  Suppose  that  some  of  the  X- 
components  among  the  sample  are  missing,  and  other  data  may  have  only  X- 

component  with  y  missing.  In  this  case,  fi  may  be  chosen  as  a  component-wise  aver¬ 


age  of  X  in  the  sample. 
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